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The purpose of this study was to examine the validity of the grades 6-8 version of DISCOVER, a performance-based assessment, and 
investigate its effectiveness in identifying gifted minority students. Questions examined the alignment between DISCOVER and 
Gardner’s (1983) theory of multiple intelligences (MI) and assessed gender and ethnic differences. The sample consisted of 395 mid- 
dle school students, predominantly Mexican Americans and Native Americans belonging to lower socioeconomic classes, from schools 
in Arizona. Results supported a good fit between DISCOVER and MI theory. That is, students identified as gifted in one intelligence 
were not necessarily identified in other intelligences. The 2x3 MANOVA (gender by ethnicity) showed no significant interaction or 
main effect for ethnicity. However, a main effect for gender was found with males outperforming females in math. No overall gender 
or ethnic differences in identification were revealed. In total, 12.4% of the participants were identified, suggesting that using DIS- 
COVER might diminish the long-standing problem of minority underrepresentation in gifted programs. 


E ducators have long acknowledged the problem of minor- 
ity students’ underrepresentation in programs for the 
gifted (Baker, 1996; Bernal, 2002; Cummings, 1991; 
Ford & Harmon, 2001; Maker, 1996). The problem is signif- 
icant, especially since underrepresentation has increased over 
the years with respect to certain ethnic groups such as Hacks, 
Hispanics, and IShtive Americans (Ford, Harris, Tyson, & 
Trotman, 2002). Some educators estimate that students from 
these culturally diverse groups are underrepresented by as 
much as 30-70% relative to their percentage in the popula- 
tion (Gabelko & Sosniak, 2002). 

The purpose of this study was to investigate the effective- 
ness of DISCOVER, a performance-based assessment, in iden- 
tifying gifted minority middle school students and reducing 
their underrepresentation in programs for the gifted. DIS- 
COVER is an acronym that stands for Discovering Intellectual 
Strengths and Capabilities through Observation while allowing 
for Varied Ethnic Responses (Maker, Nielson, & Rogers, 
1994). 


Reasons for Underrepresentation 

Ford et al. (2002) argued that the main reason for minor- 
ity underrepresentation is a “deficit perspective” that has influ- 
enced directly or indirectly the access of culturally diverse 
students to gifted programs. Educators who hold this perspec- 
tive assume that students from diverse and economically disad- 
vantaged populations are cognitively inferior because many 
score low on standardized tests and fail to meet the traditional 
criteria for placement in gifted programs, that is, scoring on the 
97th percentile or above. One symptom of the cognitive deficit 
hypothesis is the narrow definition of intelligence and gifted- 
ness adopted in most schools. A major limitation of these def- 
initions is that they do not take into consideration cultural 
factors in conceiving and demonstrating superior cognitive 
abilities (Bernal, 2002). As a result of the widespread belief in 
these traditional conceptions, identification procedures in most 
school districts (about 90%) still rely heavily on the scores of 
standardized tests, a practice that limits the access of culturally 


61 



Sarouphim 


diverse students to programs for the gifted and keeps the demo- 
graphics of these programs mostly White (Ford & firmon, 
2001; Ford et ah)- 

Alternative Assessments 
for Gifted Identification 

Many educators have advocated replacing standardized tests 
with a new breed of instruments, namely authentic assessment, 
also called alternative and performance-based assessment (Ford 
& Fdarmon, 2001; fvhker, 1996, Sarouphim, 1999). The rise 
of authentic assessment (e.g., Borland & Wright, 1994; Clasen, 
Middelton, & Connell, 1994; Flafenstein & Tucker, 1994; 
fvhker, 1992; Reid, Udall, Romanoff & Algozzine, 1999) has 
coincided with the emergence of alternative theories of intelli- 
gence, such as Gardners (1983) theory of multiple intelligences 
(MI) and Sternbergs triarchic theory of intelligence (1986). 
Along the same lines, recent conceptions of giftedness have also 
played a major role in promoting the development of authentic 
assessment. For example, Maker (1993) defined giftedness as 
“the ability to solve complex problems in effective, efficient, ele- 
gant, and economical ways” (p. 71). 

Research on alternative assessment has revealed that, more 
often than not, minority students fare better on these mea- 
sures than on traditional standardized tests (Borland & Wright, 
1994; Clasen et ah, 1994; Reid et ah, 1999; Sa rouphim, 2001), 
a factor that earned authentic assessment the reputation of 
being bias-free and expanded its use among minority groups. 
Fdowever, authentic assessment is not without its drawbacks. 
Among the many criticisms of these measures are their high 
cost, domain underrepresentation, lack of sound psychomet- 
ric properties, and their long and laborious administration 
(Dunbar, Koretz, & Hoover, 1991; Frechtling, 1991). 

Evidence Against the Use of Alternative Assessments 

Research on the effectiveness of alternative assessment has 
yielded mixed results, a factor that has led some reseanhers to 
warn against the use of these measures. For example, Plucker, 
Callahan, and Tomchin (1996) did not find evidence support- 
ing the sound psychometric pro p e rties of a battery of alterna- 
tive instruments based on the theory of multiple intelligences. 
Participants {N = 1,813) were assessed using the Mtltiple 
Intelligence Assessment Technique, based on the work of 
Project Spectium and local modifications of the DISCOVER 
assessment. Student performance was rated as “not evident or 
not observed,” “evident,” or “extremdy evident.” The results 
showed high internal consistency for the scales’ scores, and 
factor analysis confirmed the presence of the linguistic and log- 
ical-mathematical subscales, but the presence of the spatial 


and interpersonal scales we re not supported. Generally, the cor- 
relations between the different subscales we re sufficiently low 
to provide evidence for discriminant "validity, but a relatively 
high correlation was found between the ITBS language sub- 
scale and the math performance assessment, as well as between 
the math checklist and the linguistic checklist. No significant 
gender or ethnic differences were found. 

The study raised questions about the structural and con- 
struct validity of Ml-based assessments, pointing to the 
methodological difficulties of assessing the psychometric prop- 
erties of such instruments. The researchers concluded that 
although “MI theory and alternative assessments may hold 
substanti"veimplications for education of gifted students . . . 
educators using MI theory, alternative assessments, and com- 
binations of the two should subject the programs to rigorous 
evaluation” (Plucker, Callahan, & Tomchin, 1996, p. 87). 

Evidence Supporting the Use of Alternative Assessment 

In a study of whether alternative assessment might dimin- 
ish the problem of underre p resentation of minority students in 
programs for the gifted, Reid et al. (1999) compared a tradi- 
tional measure, the Matrix Analogies Test— Short Form (MAT- 
SF), with an alternate assessment, the Problem Solving 
Assessment (PSA), in identifying culturally and linguistically 
gifted students. The PSA is a measure grounded in Gardner’s 
theory of multiple intelligences and the conceptual framework 
of the DISCOVER assessment. It consists of tasks designed to 
measure linguistic, spatial, and logical-mathematical intelli- 
gences. To be recommended for placement in a gifted pro- 
gram through the use of PSA, students need a rating of “always 
evident” or “strongly evident” problem-solving behaviors in 
two or more of the intelligences assessed. A stanine perfor- 
mance of 9 in the MAT-SF was used as the criterion for iden- 
tification. 

The sample consisted of 600 students of White, Hack, 
Hispanic, American Indian, and Asian origin. The results 
showed that, through the use of MAT-SF, only 22% of the stu- 
dents met the criteria for identification, whereas about half of the 
students were identified as gifted through the use of the PSA. 
However, such a high percentage of identified students is ques- 
tionable and does not in itself validate the assessment. Also, sig- 
nificant differences were found in the distribution of identified 
students. Using the MAT-SF, 1 1% of identified students were 
minority, whereas the PSA recommended 39% of minority stu- 
dents for placement in gifted programs. However, the study has 
an inherent weakness: the lack of data on the predictivevalidity 
of the PSA. Also, a similarly high percentage of identified stu- 
dents using the MAT-SF might result by lowering the cut-score 
used. Despite these limitations, the researchers concluded that 
available anecdotal evidence on the predictive validity of the PSA 
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was favorable: Most students who we re placed in programs for 
the gifted using the PSA were successful. 

In another study, Hafenstein and Tucker (1994) assessed 
the effectiveness of a nontraditional assessment for measuring 
multiple intelligences. Trained observers assessed 3-, 4-, and 5- 
year-old children as they worked on tasks in the seven intelli- 
gences. Collected data included interviews with teachers and 
reports from the students’ parents. Bollowng the assessment, 
observers classified children’s abilities as “not evident,” “evi- 
dent,” and “extremely evident” (i.e., gifted). In mid-year, 
teachers we re asked to rate the children using the same classi- 
fication. Mean differences between the teachers’ two ratings 
were minimal; the first mean was 3.67 and the second was 
3.74. 

Also, content analysis re vealed a great similarity betwe e n 
observers’ and teachers’ ratings, as well as between the par- 
ents’ and teachers’ reports, suggesting that the use of the 
assessment led to adequate placement of children. Interviews 
with parents indicated that they believed that their children 
were well placed and thriving in the gifted program. 
Interviews with teachers showed similar results. Regression 
analysis suggested that the beginning-of-year assessment was 
predictive of future performance, AfTdd) = 4.95 1,/* < .03. 
The researdiers concluded that the performance-based assess- 
ment used in this study was an effective process for identifying 
young gifted children. 

Research on DISCOVER 

The alternative assessment, DISCOVER, which is the 
focus of this study, is grounded in MI theory and based on 
Ahker’s (1993) definition of giftedness. The assessment was 
designed to reflect Gardner’s (1983) description of the “cote 
capabilities” of each intelligence. DISCOVER was developed 
to identify gifted students from culturally diverse groups. Since 
its inception, DISCOVER has been administered to thousands 
of students from diverse populations. The data collected have 
served as the basis for research on the reliability and validity of 
the instrument. 

A series of studies were conducted to examine the psycho- 
metric properties of the K-2, 3-5, and 9-12 versions of DIS- 
COVER. These studies focused on the alignment berween 
DISCOVER and MI theory, gender differences, ethnic differ- 
ences, and concurrent validity of DISCOVER with the Raven’s 
Progressive Matrices. Even though other research on DIS- 
COVER exists, this section will be limited to the review of 
studies on the aforementioned questions because their focus 
was similar to that of the present investigation (for a thorough 
review of research on DISCOVER, see Sarouphim, 2002). 
However, one additional study on the interrater reliability of 
DISCOVER is worth mentioning here. Griffiths (1996) com- 


pared the ratings observers gave to students on the spatial activ- 
ities and those marked by independent raters who watched 
videotapes of the recorded administration. The results showed 
high interrater agreement, ranging from 80% to 100%, with 
the highest agreement found between the observers and inde- 
pendent raters with the most expertise in the administration 
of DISCOVER. 

Fit between DISCOVER and MI theory. Sarouphim (2000) 
investigated the alignment of DISCOVER with the theory of 
multiple intelligences through a series of interobserver correla- 
tions. The sample consisted of 254 elementary students, pre- 
dominantly from economically disadvantaged Native 
American and Hispanic groups. All participants took either the 
K— 2 or the 3—5 version of DISCOVER, depending on their 
grade level. The results showed low interobserver correlations 
across grade levels between the activities that measured differ- 
ent intelligences (e.g., linguistic and spatial activities) and mod- 
erate to high correlations between activities that measured 
related intelligences (e.g., oral linguistic and written linguistic), 
indicating that students who we re identified as gifted in one 
intelligence we re not necessarily identified as gifted in the other 
intelligences. The results suggested that the different DIS- 
COVER activities with distinguishable cognitive tasks may 
measure different intelligences, a finding that might provide 
suppottto the consistency between DISCOVER and Gardner’s 
MI theory. 

Concurrent validity . In another study, Sarouphim (2001) 
examined the concurrent validity of DISCOVER with the 
Raven’s PtogressiveMatrices. The study also examined gender 
differences in identification rates and the percentage of minor- 
ity students identified by DISCOVER and traditional stan- 
dardized tests. The results, which we re based on a sample of 
Native American and Hispanic students, showed a high corre- 
lation between the students’ scores on the Raven’s and their rat- 
ings in the spatial activities of DISCOVER and low 
cor relations between the students’ Raven scores and their rat- 
ings in the linguistic activities of DISCOVER, providing evi- 
dence for the convergent and discriminant validity of 
DISCOVER. Using the criterion of a “definitely” rating in at 
least two of the activities, the results also showed that, through 
the use of the DISCOVER assessment, 22.9% of the students 
we re identified. In addition, no significant gender differences 
we re found in identification, possibly indicating that the assess- 
ment is mostly fair and does not discriminate against gender 
or ethnicity. 

Gender and ethnic differences. Enally, Sarouphim (2002) 
investigated the effectiveness of the 9—12 version of DIS- 
COVER. The sample consisted of 303 ninth graders, predom- 
inantly Hispanic and Native American students. The results 
provided evidence for an alignment of the assessment with the 
theory of multiple intelligences. Also, no overall gender or eth- 
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nic differences we re found in identification. In addition, the 
results suggested that the use of the DISCOVER assessment 
might help in reducing the problem of minority students’ 
underrepresentation in programs for the gifted, as 29.3% of the 
high school students who participated in the study we re iden- 
tified as gifted. 

The Current Study 

The purpose of this study was to examine the validity of 
the 6-8 version of DISCOVER and investigate the effective- 
ness of this assessment in identifying gifted middle school stu- 
dents from culturally diverse groups. Three questions guided 
this inquiry: (1) Are DISCOVER and MI theory aligned? That 
is, does DISCOVER tap into the different intelligences as 
identified by Gardner? (2) Do gender differences appear 
through the use of DISCOVER? (3) Do ethnic diffeiences 
appear through the use of DISCOVER? 

Method 

Participants 

The sample consisted of 395 male (54%, n = 214) and 
female (46%, n = 81) sixth, seventh, and eighth graders from 
18 schools in Arizona. Participants were mostly of low socioe- 
conomic status, as evidenced by their place of residence and 
their participation in their school’s free lunch program 
(85%). Participants we re Native Americans from the Navajo 
tribe (47%, n = 186) attending 8 schools in northern Arizona 
and Mexican Americans (38.5%, n = 151) and White 
Americans (14.5%, n = 58) attending 10 schools in southern 
Arizona. The mean ages were 13.1, 13.3, and 13.6 years for 
White, Mexican American, and Native American students, 
respectively. The mean age of females was 13.1 years and 13.2 
years for males. 

Instrument 

DISCO\ER is a performance-based assessment consist- 
ing of five activities designed to measure individuals’ problem- 
solving abilities in each of the following intelligences: Pablo® 
(spatial), Tangrams (spatial/logical-mathematical). Math (log- 
ical- mathematical). Storytelling (oral linguistic), and 
Storywriting (written linguistic). Each activity proceeds 
through a series of tasks that progress from structured to more 
open or “fuzzy” problems. To avoid observer bias, observers 
rotate at the completion of each activity so that each student is 
assessed only once (i.e., during one activity only) by the same 


observer. The following is a brief description of each activity 
(for a thorough description of DISCOVER, see Maker, 1 992, 
and Sa rouphim, 1999). 

Pablo. The material for this activity consists of colored 
cardboard pieces of different shapes, designs, and sizes. 
Students are asked to make different constructions (e.g., geo- 
metrical designs, a container, a machine, and a construction 
of their choice) using the Pablo pieces. 

Tangrams. Each student is given a set of Chinese Tangrams 
(21 pieces of three different shapes: triangles of three different 
sizes, squares, and parallelograms). Students are requested to 
make a parallelogram using as many Tangram pieces as possi- 
ble. Then, each student is given a booklet of six puzzle sheets 
arranged in ascending order of difficulty and asked to solve 
them. Students who complete this task are then provided with 
“Challenge Sheets,” which consist of more difficult puzzle 
problems. 

Storytelling. Students are given an array of toys and are 
asked to describe one and then two of their toys using as many 
descriptors as possible. Then, students are asked to tell a story 
of their choice that incorporates some or all of the toys they 
have been given. 

Storywriting. Students are asked to produce a written piece 
of their choice (story, poem, etc.) about a topic of their choice. 

Math. Worksheets consisting mostly of open-ended 
numerical problems are used to assess this intelligence. The 
problems increase in openness and difficulty, with the last 
problem consisting of creating as many problems as possible 
(fractions, multiplication, division, etc.) having a prespecified 
number as the answer. 

Procedures 

All participants were assessed through DISCOVER for 
identification purposes. Trained observers administered the 
spatial, logical-mathematical, and verbal linguistic activities 
in class and took notes while students worked in groups of 
four or five students, with a ratio of 1:4 or 1:5 (one observer 
to four or five students). All observers completed a 2-day 
training during which they learned about the conceptual and 
applied aspects of the assessment, followed by at least four in 
vivosessions during which they “shadowed” expert observers 
and watched them in action. Observers were all holders of 
doctoral degrees in education or graduate students pursuing 
advanced degrees in education. In most classes, teachers read 
the instructions, but, in a few instances where teachers pre- 
ferred not to take part in the process, one of the DISCOVER 
observers gave the instructions. The possible source of error 
that might result from an observer rather than the classroom 
teacher giving instructions is kept minimal since the DIS- 
COVER administration procedures are standardized. For 
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Table 1 

Observers'^ Interrating Correlations 


Activity 

1 

2 

3 

4 

5 

Pablo 

_ 

0.239** 

0.267“ 

0.074 

0.012 

Tangrams 

- 

- 

0.102 

0.084 

0.083 

Math 

- 

- 

- 

0.183' 

0.023 

Story 

- 

- 

- 

- 

0.258" 

Writing 

- 

- 

- 

- 

- 


Note. *p < 0.05- **p < 0.01 


accuracy purposes, the administration sessions were video- 
taped and the students’ stories were tape-recorded and tran- 
scribed verbatim at a later time. A day preceding or following 
the group activities, the students worked individually on the 
written linguistic and math worksheet components of DIS- 
COVER. 

Followng data collection, the DISCOVER observers met 
and discussed the students’ performances. They then ranked 
students’ performances in each of the activities according to a 
4-category rating scale of “unknowi,” “maybe,” “probably,” 
and “definitely,” with the last rating category being the high- 
est and corresponding to superior problem-solving ability or 
giftedness. Usually, students given the “definitely” rating cate- 
gory in at least two of the activities are identified as gifted; how- 
ever, the identification criteria are flexible to a certain extent 
(e.g., in some schools, students given three “definitely” ratings 
are identified as gifted) and depend on the school district iden- 
tification procedures, as well as the breadth of programs for the 
gifted offered at each particular school (Sarouphim, 2001). 
However, in this study, the criterion for identification used was 
set at a rating of “definitely’in at least two of the activities. 
Data we re coded as follows: 1 for “unknown,” 2 for “maybe,” 3 
for “probably,” and 4 for “definitely.” 

To assign a rating, observers are guided by a checklist 
they complete for each child. Items on the checklist repre- 
sent superior problem-solving behaviors (process) and char- 
acteristics of products. For example, in Pablo, observers note 
how the final construction was produced and whether the 
constructions are three-dimensional, complex, and original 
and whether they incorporate many pieces. In Tangrams, 
observers note the number of puzzle sheets solved, the strate- 
gies used (e.g., fitting pieces without physically rotating 
them), the time it takes students to solve them, and the 
number of Tangram pieces used to complete a square or a tri- 
angle. In Storytelling and Storywriting, observers look for 
fluency, well-constructed plots, appropriate sequence of 
events, and the quality of words and sentences. In Math, 
strategies (e.g., creating problems and solving them), as well 


as the number of problems solved, are taken into considera- 
tion. Even though the checklist constitutes an important ele- 
ment in ranking students, it is not considered exclusive in 
the decision-making process. That is, the observers’ classifi- 
cation is also based on noting superior problem-solving 
behavior not included in the checklist. The rationale is that 
giftedness is not limited to a few behaviors, so observers need 
to keep an open mind and note all performance that might 
denote superior ability. 

Results 

To assess the alignment of DISCOVER with MI theory, 
the ratings given to students by trained observers in the five 
DISCOVER activities were correlated, yielding a matrix of 
observers’ interrating correlations. For gender and ethnic dif- 
ferences in activity (i.e., per intelligence), a 2 x 3 MANOVA 
(gender by ethnicity) and follow-up ANOVA we re calculated, 
whereas for gender and ethnic differences in identification (i.e., 
the number of identified students per gender and ethnicity), 
the chi-square statistic was used. 

Alignment of DISCOVER With Ml Theory 

As seen in Table 1 , all correlations were low, though some 
we re significant. The highest correlation was found between 
the two activities of Math and Pablo, r = .267, p < .01, and 
the lowest between the two activities of Math and Sorywriting, 
r = .023, ns. A significant correlation was found between the 
two activities of Stoiyrelling and St o lywriting, r= .258,/) < .01, 
both measuring linguistic intelligence. The results suggest that 
students’ ratings (i.e., performance) were comparable in the 
activities that assessed a similar intelligence and discrepant in 
the activities that assessed different intelligences. 

Gender and Ethnic Differences by Activity 

Table 2 shows the students’ mean ratings and standard 
deviations in all activities. The means ranged between “maybe” 
and “probably,” with White males obtaining the highest mean 
ratings in the Math activity and Native American males obtain- 
ing the lowest mean ratings in the Storywriting activity. The 
multivariate analysis of variance showed no significant inter- 
action effect between gender and ethnicity, F{6,562) = 1.61,/) 
> .05. Smilarly, the results revealed no significant main effect 
for ethnicity, F{6, 562) = 1.03,/) > .05; however, a significant 
main effect for gender was found, Aid, 541) = 9.67, p < .01. 
Post-hoc univariate analysis showed that males outpoformed 
females in fvhth, T'(l,373) = 7.57,/) < .01, yielding an effect 
size of .69. 
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Table 2 

Students'^ Mean Ratings and Standard Deviations by Gender and Ethnicity 



Mexican American 

Native American 

White American 

All 

Pablo 

M 

2.65 (0.97) 

3.01 (0.84) 

2.90 (0.73) 

2.85 (0.84) 

F 

2.71 (0.91) 

2.79 (0.89) 

2.84 (0.80) 

2.78 (0.86) 

All 

2.68 (0.94) 

2.90 (0.86) 

2.87 (0.76) 

2.81 (0.85) 

Tangrams 

M 

2.65 (0.86) 

2.58 (0.72) 

2.70 (0.91) 

2.64 (0.83) 

F 

2.71 (0.91) 

2.60 (0.81) 

2.65 (0.79) 

2.65 (0.83) 

All 

2.68 (0.88) 

2.59 (0.76) 

2.67 (0.85) 

2.64 (0.83) 

Math 

M 

2.97 (0.65) 

3.01 (0.98) 

3.11 (1.02) 

3.03 (0.88) 

F 

2.54 (0.88) 

2.48 (0.79) 

2.63 (1.13) 

2.55 (0.93) 

All 

2.75 (0.76) 

2.74 (0.88) 

2.87(1.07) 

2.79 (0.90) 

Storytelling 

M 

2.92 (0.87) 

2.85 (0.71) 

3.05 (0.87) 

2.94 (0.81) 

F 

2.98 (0.93) 

2.71 (0.89) 

3.02 (0.92) 

2.90 (0.91) 

All 

2.95 (0.90) 

2.78 (0.80) 

3.03 (0.89) 

2.92 (0.86) 

Storywriting 

M 

2.51 (0.76) 

2.36 (0.84) 

2.43 (0.73) 

2.43 (0.77) 

F 

2.74 (0.82) 

2.64 (0.88) 

2.86 (0.81) 

2.74 (0.83) 

All 

2.62 (0.79) 

2.50 (0.86) 

2.64 (0.77) 

2.58 (0.80) 


Note. Unknown = 1 , Maybe = 2, Probably = 3, Definitely = 4. 


Gender and Ethnic Differences by Identification 

The number of males and females identified as gifted (i.e., 
given at least two “definitely” ratings) in the DISCOVER activ- 
ities was calculated. The results revealed that the number of 
identified males was not significantly higher than the number 
of identified females (see Table 3). The Mexican American 
group had the highest percentage of identified students, fol- 
lowed by Native Americans and Whites. The chi-square test 
showed no significant overall gender or ethnic differences in 
identification, (2, 94) = .346, ns. In total, 12.4% of partici- 
pants were identified as gifted. 

Discussion 

The purpose of this study was to examine the validity of 
the grades 6-8 version of DISCOVER, a performance-based 


assessment, and investigate its effectiveness in identifying gifted 
minority students. The research questions focused on the align- 
ment between DISCOVER and MI theory and an examina- 
tion of gender and ethnic differences. The results showed a 
good fit between DISCOVER and MI theory and an absence 
of significant gender and ethnic differences in identification. 
However, the univariate analysis showed that males outper- 
formed females in the Math activity of DISCOVER. Hnally, 
the percentage of students identified as gifted through the use 
of DISCOVER was higher than the traditional 3% designated 
through the use of traditional standardized tests. 

In this study, the low observers’ interrating correlations 
indicated that a student who was given a high rating in one 
intelligence (i.e., identified as gifted) was not necessarily given 
the same high rating in the other intelligences. In other words, 
the findings re veal that DISCOVER might tap into a variety of 
intelligences, suggesting a good fit between DISCOVER and 
MI theory. On the other hand, the finding that all correlations 
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Table 3 

Gifted Participants 
by Gender and Ethnicity 


Mexican Native White Total 

American American American 



All 

Gifted 

All 

Gifted 

All 

Gifted 

All 

Gifted 

Males 

78 

11 

101 

13 

35 

4 

214 

28 

Females 

73 

9 

85 

10 

23 

2 

181 

21 

Total 

151 

20 

186 

23 

58 

6 

395 

49 


were low, including those found significant, is problematic. 
According to Gardner (1983), some of the intelligences are 
tightly related, such as spatial and logical-mathematical, which 
he called “twin” intelligences, so the correlations in ratings 
given to students in Pablo, Tangrams, and Math were expected 
to yield moderately high values, rather than the relative^ low 
values found in this study. The same reasoning applies to the 
activities of Storytelling and Storywriting, which both mea- 
surelinguistic intelligence. Also, statistical significance of the 
correlations in this study might be due to the lelativdy large 
sample size, rather than a true correlation in the population. 
In further studies with similar analyses, further exploration of 
this issue is needed to explain this problematic finding. 

Moreover, the activities included in DISCOVER do not 
cover the whole spectrum of Gardners recogni 2 Bd intelligences, 
namely, bodily-kinesthetic, musical, naturalist, existentialist, 
and the personal intelligences. The rationale for focusing on 
spatial, mathematical, and linguistic intelligences is that a good 
match must exist between an assessment and a placement pro- 
gram and, at this time, programs for the gifted emphasize 
mostly these three intelligences. However, one recommenda- 
tion that stems from this study is that, for DISCOVER to be 
faithful to MI theory, all the intelligences must be assessed. 
Therefore, activities for all intelligences need to be developed. 
Similarly, another recommendation is that school officials who 
intend to use DISCOVER for placement purposes need to be 
made aware of the limitations of the assessment, especially if a 
discrepancy exists between the focus of the gifted program and 
that of DISCOVER. 

An interesting finding was the absence of gender and eth- 
nic differences in identification. Educators have long deplored 
the ethnic bias in identification (Bernal, 2002; Ford & 
Harmon, 2001). The absence of such a bias in this study, as 
well as the high percentage of identified students, suggests that 
the use of DISCOVER might help in reducing the problem 
of minority underrepresentation in programs for the gifted. 


This finding is compatible with the results of other studies in 
which performance-based assessments were used for identifi- 
cation purposes (Borland & Wright, 1994; Clasen et al., 1994; 
Hafenstein & Tucker, 1994; Reid et al., 1999). In these stud- 
ies, the final pool of identified students was larger than that 
usually found through the administration of standardized tests. 

However, the identification of a large number of minority 
students does not justify in and ofitself the use of DISCOVER 
or any other performance-based assessment. Rather, the use of 
an instrument must be justified by evidence of its effectiveness. 
Even though preliminary reseanh results provide evidence for 
the high reliability and validity of DISCOVER, data on the 
piedictivevalidity of the assessment, which would be a better 
indicator of its effectiveness, are still missing. At this time, a 
longitudinal study is underway to follow the academic progress 
of a group of Native American students (the “Step Up” group) 
who we re all identified through DISCOVER and placed in dif- 
ferent programs for the gifted. But, until the results are out, 
solid conclusions on the predictive validity of DISCOVER 
cannot be drawn. 

A noteworthy finding is that White students performed 
as well as minority participants in the DISCOVER activities, 
which might indicate that DISCOVER could be used for the 
identification of majority students, as well. However, the sam- 
ple of White students in this study was too small to warrant 
such an assumption. In further studies, larger numbers of 
White students must be included in the research sample to 
clarify this issue. 

The finding that males received higher ratings in the Math 
activity is congruent with a body of res earth on gender differ- 
ences in mathematical intelligence (e.g., Lubinski & Benbow, 
1992). In previous studies on the K-2, 3-5, and 9-12 ver- 
sions of DISCOVER, gender differences in the Math activity 
were not found (Sarouphim, 1999, 2002). One explanation 
might be that the Math activity in the middle school version 
of DISCOVER is more sensitive to students’ strengths and taps 
better into this intelligence. Such an explanation entails a revi- 
sion of the Math activity in the other versions of DISCOVER 
because, if a difference truly exists in the population, an assess- 
ment that shows such a difference has more valid scores than 
one that does not. However, other explanations specific to the 
population of middle school students might be considered. For 
example, during adolescence, girls start showing interest in 
boys and their behavior becomes gender-specific. That is, they 
are more careful to abide by behaviors labeled “feminine” in the 
cultuie(Lynch, 1991). Traditionally, superiority in math has 
been associated with males (Eccles, 1993), and, consequently, 
an outstanding performance in math might not be considered 
“feminine,” which might explain the drop in the girls’ math 
performance in the 6—8 version of DISCOVER. However, fur- 
ther research is needed to explain such a finding. 
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Moreover, in previous teseardi on DISCOVER, the per- 
centages of identified students were higher (i.e., 22.9% in K-5 
and 29.3% in 9—12). An explanation of this finding might be 
grounded in the population studied. Middle school is the 
bridge between elementary and secondary school and consti- 
tutes a period of academic transition (Coleman, 2001). 
feseardiers have long noticed a drop in the academic perfor- 
mance of middle school students. Some have attributed this 
decline to a traditional classroom environment that does not 
match early adolescents’ needs and interest (Eccles, Wigfield, 
Mdgley, Reuman, hfeclver, & Feldlaufer, 1992). Therefore, 
the lower percentage of identified students in this study might 
reflect the general academic decline that accompanies the tran- 
sition from elementary to middle school. However, the reason 
might also be related to the specific tasks of the 6-8 version of 
DISCO\ER. Again, further examination of the activities in 
DISC OVER through a thorough content analysis of the tasks 
incorporated is needed to clarify this point. 

The results of this study provide positive evidence for the 
use of the DISCOVER assessment with culturally diverse 
groups, namely Mexican Americans and Native Americans. 
The implications are of significance to practitioners and 
researchers alike who have been striving relentlessly to find 
means for rectifying the long-standing injustice of minority 
underrepresentation in programs for the gifted. Thus, DIS- 
COVER can be used for identification purposes, especially for 
the placement of students in gifted programs that match the 
theoretical bases of the assessment (i.e., multiple intelligences). 
A paradigm shift in assessment procedures can contribute sig- 
nificantly to the solution of minority students’ underrepresen- 
tation in gifted programs. In the continuous struggle to 
establish equity in gifted education, the use of authentic assess- 
ment seems to be promising. Instruments such as DISCOVER 
can greatly contribute to diversifying gifted education pro- 
grams. Until that goal is reached, educators “should and can 
exercise greater vigilance in pursuit of educational values that 
will move alternative assessment reform in educationally rig- 
orous, equitable, and sustainable directions” (Hargreaves, Earl, 
& Schmidt, 2002, p. 93). 
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